Overview

Dataset Statistics

Number of Variables 23
Number of Rows 3.8403e+06
Missing Cells 5.8774e+06
Missing Cells (%) 6.7%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 875.7 MB
Average Row Size in Memory 239.1 B
Variable Types
  • Numerical: 22
  • Categorical: 1

Dataset Insights

AMT_DRAWINGS_OTHER_CURRENT and CNT_DRAWINGS_OTHER_CURRENT have similar distributions Similar Distribution
AMT_RECIVABLE and AMT_TOTAL_RECEIVABLE have similar distributions Similar Distribution
SK_DPD and SK_DPD_DEF have similar distributions Similar Distribution
AMT_DRAWINGS_ATM_CURRENT has 749816 (19.52%) missing values Missing
AMT_DRAWINGS_OTHER_CURRENT has 749816 (19.52%) missing values Missing
AMT_DRAWINGS_POS_CURRENT has 749816 (19.52%) missing values Missing
AMT_INST_MIN_REGULARITY has 305236 (7.95%) missing values Missing
AMT_PAYMENT_CURRENT has 767988 (20.0%) missing values Missing
CNT_DRAWINGS_ATM_CURRENT has 749816 (19.52%) missing values Missing
CNT_DRAWINGS_OTHER_CURRENT has 749816 (19.52%) missing values Missing
CNT_DRAWINGS_POS_CURRENT has 749816 (19.52%) missing values Missing
CNT_INSTALMENT_MATURE_CUM has 305236 (7.95%) missing values Missing
AMT_BALANCE is skewed Skewed
AMT_CREDIT_LIMIT_ACTUAL is skewed Skewed
AMT_DRAWINGS_ATM_CURRENT is skewed Skewed
AMT_DRAWINGS_CURRENT is skewed Skewed
AMT_DRAWINGS_OTHER_CURRENT is skewed Skewed
AMT_DRAWINGS_POS_CURRENT is skewed Skewed
AMT_INST_MIN_REGULARITY is skewed Skewed
AMT_PAYMENT_CURRENT is skewed Skewed
AMT_PAYMENT_TOTAL_CURRENT is skewed Skewed
AMT_RECEIVABLE_PRINCIPAL is skewed Skewed
AMT_RECIVABLE is skewed Skewed
AMT_TOTAL_RECEIVABLE is skewed Skewed
CNT_DRAWINGS_ATM_CURRENT is skewed Skewed
CNT_DRAWINGS_CURRENT is skewed Skewed
CNT_DRAWINGS_OTHER_CURRENT is skewed Skewed
CNT_DRAWINGS_POS_CURRENT is skewed Skewed
CNT_INSTALMENT_MATURE_CUM is skewed Skewed
SK_DPD is skewed Skewed
SK_DPD_DEF is skewed Skewed
MONTHS_BALANCE has 3840312 (100.0%) negatives Negatives
AMT_RECIVABLE has 109338 (2.85%) negatives Negatives
AMT_TOTAL_RECEIVABLE has 109330 (2.85%) negatives Negatives
AMT_BALANCE has 2156420 (56.15%) zeros Zeros
AMT_CREDIT_LIMIT_ACTUAL has 753823 (19.63%) zeros Zeros
AMT_DRAWINGS_ATM_CURRENT has 2665718 (69.41%) zeros Zeros
AMT_DRAWINGS_CURRENT has 3223443 (83.94%) zeros Zeros
AMT_DRAWINGS_OTHER_CURRENT has 3078163 (80.15%) zeros Zeros
AMT_DRAWINGS_POS_CURRENT has 2825595 (73.58%) zeros Zeros
AMT_INST_MIN_REGULARITY has 1928864 (50.23%) zeros Zeros
AMT_PAYMENT_CURRENT has 390507 (10.17%) zeros Zeros
AMT_PAYMENT_TOTAL_CURRENT has 2172223 (56.56%) zeros Zeros
AMT_RECEIVABLE_PRINCIPAL has 2296167 (59.79%) zeros Zeros
AMT_RECIVABLE has 2113816 (55.04%) zeros Zeros
AMT_TOTAL_RECEIVABLE has 2113643 (55.04%) zeros Zeros
CNT_DRAWINGS_ATM_CURRENT has 2665718 (69.41%) zeros Zeros
CNT_DRAWINGS_CURRENT has 3229952 (84.11%) zeros Zeros
CNT_DRAWINGS_OTHER_CURRENT has 3077688 (80.14%) zeros Zeros
CNT_DRAWINGS_POS_CURRENT has 2825594 (73.58%) zeros Zeros
CNT_INSTALMENT_MATURE_CUM has 551467 (14.36%) zeros Zeros
SK_DPD has 3686957 (96.01%) zeros Zeros
SK_DPD_DEF has 3750972 (97.67%) zeros Zeros
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6

Variables


SK_ID_PREV

numerical

Approximate Distinct Count 104307
Approximate Unique (%) 2.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 61444992
Mean 1.9045e+06
Minimum 1000018
Maximum 2843496
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • SK_ID_PREV is skewed right (γ1 = 0.0384)

Quantile Statistics

Minimum 1000018
5-th Percentile 1.0827e+06
Q1 1.4355e+06
Median 1.8978e+06
Q3 2.3703e+06
95-th Percentile 2.7493e+06
Maximum 2843496
Range 1843478
IQR 934790.25

Descriptive Statistics

Mean 1.9045e+06
Standard Deviation 536469.4706
Variance 2.878e+11
Sum 7.3139e+12
Skewness 0.03839
Kurtosis -1.2196
Coefficient of Variation 0.2817

SK_ID_CURR

numerical

Approximate Distinct Count 103558
Approximate Unique (%) 2.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 61444992
Mean 278324.2073
Minimum 100006
Maximum 456250
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • SK_ID_CURR is skewed left (γ1 = -0.0018)

Quantile Statistics

Minimum 100006
5-th Percentile 118358
Q1 189836.5
Median 278600
Q3 367766
95-th Percentile 438443
Maximum 456250
Range 356244
IQR 177929.5

Descriptive Statistics

Mean 278324.2073
Standard Deviation 102704.4751
Variance 1.0548e+10
Sum 1.0689e+12
Skewness -0.001834
Kurtosis -1.1995
Coefficient of Variation 0.369
  • SK_ID_CURR is not normally distributed (p-value 2.596987723444231e-05)

MONTHS_BALANCE

numerical

Approximate Distinct Count 96
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 61444992
Mean -34.5219
Minimum -96
Maximum -1
Zeros 0
Zeros (%) 0.0%
Negatives 3840312
Negatives (%) 100.0%
  • MONTHS_BALANCE is skewed left (γ1 = -0.598)

Quantile Statistics

Minimum -96
5-th Percentile -83
Q1 -55
Median -27
Q3 -11
95-th Percentile -3
Maximum -1
Range 95
IQR 44

Descriptive Statistics

Mean -34.5219
Standard Deviation 26.6678
Variance 711.1689
Sum -1.3257e+08
Skewness -0.598
Kurtosis -0.8562
Coefficient of Variation -0.7725
  • MONTHS_BALANCE is not normally distributed (p-value 0.0007426410851260633)

AMT_BALANCE

numerical

Approximate Distinct Count 1347904
Approximate Unique (%) 35.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 61444992
Mean 58300.1553
Minimum -420250.185
Maximum 1.5059e+06
Zeros 2156420
Zeros (%) 56.1%
Negatives 2345
Negatives (%) 0.1%
  • AMT_BALANCE is skewed right (γ1 = 2.9202)

Quantile Statistics

Minimum -420250.185
5-th Percentile 0
Q1 0
Median 0
Q3 90627.4912
95-th Percentile 265747.635
Maximum 1.5059e+06
Range 1.9262e+06
IQR 90627.4912

Descriptive Statistics

Mean 58300.1553
Standard Deviation 106307.031
Variance 1.1301e+10
Sum 2.2389e+11
Skewness 2.9202
Kurtosis 11.7787
Coefficient of Variation 1.8234
  • AMT_BALANCE is not normally distributed (p-value 7.05895024821181e-24)
  • AMT_BALANCE has 240018 outliers

AMT_CREDIT_LIMIT_ACTUAL

numerical

Approximate Distinct Count 181
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 61444992
Mean 153807.9574
Minimum 0
Maximum 1350000
Zeros 753823
Zeros (%) 19.6%
Negatives 0
Negatives (%) 0.0%
  • AMT_CREDIT_LIMIT_ACTUAL is skewed right (γ1 = 2.0597)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 45000
Median 135000
Q3 180000
95-th Percentile 450000
Maximum 1350000
Range 1350000
IQR 135000

Descriptive Statistics

Mean 153807.9574
Standard Deviation 165145.6995
Variance 2.7273e+10
Sum 5.9067e+11
Skewness 2.0597
Kurtosis 5.184
Coefficient of Variation 1.0737
  • AMT_CREDIT_LIMIT_ACTUAL is not normally distributed (p-value 2.3958670492742633e-10)
  • AMT_CREDIT_LIMIT_ACTUAL has 404927 outliers

AMT_DRAWINGS_ATM_CURRENT

numerical

Approximate Distinct Count 2267
Approximate Unique (%) 0.1%
Missing 749816
Missing (%) 19.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 49447936
Mean 5961.3248
Minimum -6827.31
Maximum 2.115e+06
Zeros 2665718
Zeros (%) 69.4%
Negatives 1
Negatives (%) 0.0%
  • AMT_DRAWINGS_ATM_CURRENT is skewed right (γ1 = 9.6648)

Quantile Statistics

Minimum -6827.31
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 36000
Maximum 2.115e+06
Range 2.1218e+06
IQR 0

Descriptive Statistics

Mean 5961.3248
Standard Deviation 28225.6886
Variance 7.9669e+08
Sum 1.8423e+10
Skewness 9.6648
Kurtosis 164.9273
Coefficient of Variation 4.7348
  • AMT_DRAWINGS_ATM_CURRENT is not normally distributed (p-value 4.4272042101211345e-25)
  • AMT_DRAWINGS_ATM_CURRENT has 424778 outliers

AMT_DRAWINGS_CURRENT

numerical

Approximate Distinct Count 187005
Approximate Unique (%) 4.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 61444992
Mean 7433.3882
Minimum -6211.62
Maximum 2.2871e+06
Zeros 3223443
Zeros (%) 83.9%
Negatives 3
Negatives (%) 0.0%
  • AMT_DRAWINGS_CURRENT is skewed right (γ1 = 10.0656)

Quantile Statistics

Minimum -6211.62
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 45000
Maximum 2.2871e+06
Range 2.2933e+06
IQR 0

Descriptive Statistics

Mean 7433.3882
Standard Deviation 33846.0773
Variance 1.1456e+09
Sum 2.8547e+10
Skewness 10.0656
Kurtosis 184.2742
Coefficient of Variation 4.5533
  • AMT_DRAWINGS_CURRENT is not normally distributed (p-value 4.474256021940704e-25)
  • AMT_DRAWINGS_CURRENT has 616869 outliers

AMT_DRAWINGS_OTHER_CURRENT

numerical

Approximate Distinct Count 1832
Approximate Unique (%) 0.1%
Missing 749816
Missing (%) 19.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 49447936
Mean 288.1696
Minimum 0
Maximum 1.5298e+06
Zeros 3078163
Zeros (%) 80.2%
Negatives 0
Negatives (%) 0.0%
  • AMT_DRAWINGS_OTHER_CURRENT is skewed right (γ1 = 50.5703)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 1.5298e+06
Range 1.5298e+06
IQR 0

Descriptive Statistics

Mean 288.1696
Standard Deviation 8201.9893
Variance 6.7273e+07
Sum 8.9059e+08
Skewness 50.5703
Kurtosis 3628.004
Coefficient of Variation 28.4624
  • AMT_DRAWINGS_OTHER_CURRENT is not normally distributed (p-value 4.226655099641841e-25)

AMT_DRAWINGS_POS_CURRENT

numerical

Approximate Distinct Count 168748
Approximate Unique (%) 5.5%
Missing 749816
Missing (%) 19.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 49447936
Mean 2968.8048
Minimum 0
Maximum 2.2393e+06
Zeros 2825595
Zeros (%) 73.6%
Negatives 0
Negatives (%) 0.0%
  • AMT_DRAWINGS_POS_CURRENT is skewed right (γ1 = 19.4211)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 11250
Maximum 2.2393e+06
Range 2.2393e+06
IQR 0

Descriptive Statistics

Mean 2968.8048
Standard Deviation 20796.887
Variance 4.3251e+08
Sum 9.1751e+09
Skewness 19.4211
Kurtosis 713.9872
Coefficient of Variation 7.0051
  • AMT_DRAWINGS_POS_CURRENT is not normally distributed (p-value 4.258058858926164e-25)
  • AMT_DRAWINGS_POS_CURRENT has 264901 outliers

AMT_INST_MIN_REGULARITY

numerical

Approximate Distinct Count 312266
Approximate Unique (%) 8.8%
Missing 305236
Missing (%) 7.9%
Infinite 0
Infinite (%) 0.0%
Memory Size 56561216
Mean 3540.2041
Minimum 0
Maximum 202882.005
Zeros 1928864
Zeros (%) 50.2%
Negatives 0
Negatives (%) 0.0%
  • AMT_INST_MIN_REGULARITY is skewed right (γ1 = 2.4944)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 6750
95-th Percentile 13642.515
Maximum 202882.005
Range 202882.005
IQR 6750

Descriptive Statistics

Mean 3540.2041
Standard Deviation 5600.1541
Variance 3.1362e+07
Sum 1.2515e+10
Skewness 2.4944
Kurtosis 10.1825
Coefficient of Variation 1.5819
  • AMT_INST_MIN_REGULARITY is not normally distributed (p-value 2.69803435070876e-23)
  • AMT_INST_MIN_REGULARITY has 121953 outliers

AMT_PAYMENT_CURRENT

numerical

Approximate Distinct Count 163209
Approximate Unique (%) 5.3%
Missing 767988
Missing (%) 20.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 49157184
Mean 10280.5377
Minimum 0
Maximum 4.2892e+06
Zeros 390507
Zeros (%) 10.2%
Negatives 0
Negatives (%) 0.0%
  • AMT_PAYMENT_CURRENT is skewed right (γ1 = 12.9906)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 157.5
Median 2921.49
Q3 9000
95-th Percentile 34200
Maximum 4.2892e+06
Range 4.2892e+06
IQR 8842.5

Descriptive Statistics

Mean 10280.5377
Standard Deviation 36078.085
Variance 1.3016e+09
Sum 3.1585e+10
Skewness 12.9906
Kurtosis 315.7564
Coefficient of Variation 3.5094
  • AMT_PAYMENT_CURRENT is not normally distributed (p-value 4.259655491614744e-25)
  • AMT_PAYMENT_CURRENT has 282489 outliers

AMT_PAYMENT_TOTAL_CURRENT

numerical

Approximate Distinct Count 182957
Approximate Unique (%) 4.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 61444992
Mean 7588.8567
Minimum 0
Maximum 4.2783e+06
Zeros 2172223
Zeros (%) 56.6%
Negatives 0
Negatives (%) 0.0%
  • AMT_PAYMENT_TOTAL_CURRENT is skewed right (γ1 = 14.4797)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 6750
95-th Percentile 24559.3237
Maximum 4.2783e+06
Range 4.2783e+06
IQR 6750

Descriptive Statistics

Mean 7588.8567
Standard Deviation 32005.9878
Variance 1.0244e+09
Sum 2.9144e+10
Skewness 14.4797
Kurtosis 393.2553
Coefficient of Variation 4.2175
  • AMT_PAYMENT_TOTAL_CURRENT is not normally distributed (p-value 4.24632524959144e-25)
  • AMT_PAYMENT_TOTAL_CURRENT has 324742 outliers

AMT_RECEIVABLE_PRINCIPAL

numerical

Approximate Distinct Count 1195839
Approximate Unique (%) 31.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 61444992
Mean 55965.8769
Minimum -423305.82
Maximum 1.4723e+06
Zeros 2296167
Zeros (%) 59.8%
Negatives 2428
Negatives (%) 0.1%
  • AMT_RECEIVABLE_PRINCIPAL is skewed right (γ1 = 2.9423)

Quantile Statistics

Minimum -423305.82
5-th Percentile 0
Q1 0
Median 0
Q3 86906.745
95-th Percentile 256500
Maximum 1.4723e+06
Range 1.8956e+06
IQR 86906.745

Descriptive Statistics

Mean 55965.8769
Standard Deviation 102533.6168
Variance 1.0513e+10
Sum 2.1493e+11
Skewness 2.9423
Kurtosis 11.9584
Coefficient of Variation 1.8321
  • AMT_RECEIVABLE_PRINCIPAL is not normally distributed (p-value 4.9717662981849306e-24)
  • AMT_RECEIVABLE_PRINCIPAL has 243330 outliers

AMT_RECIVABLE

numerical

Approximate Distinct Count 1338878
Approximate Unique (%) 34.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 61444992
Mean 58088.8112
Minimum -420250.185
Maximum 1.4933e+06
Zeros 2113816
Zeros (%) 55.0%
Negatives 109338
Negatives (%) 2.8%
  • AMT_RECIVABLE is skewed right (γ1 = 2.9132)

Quantile Statistics

Minimum -420250.185
5-th Percentile 0
Q1 0
Median 0
Q3 90359.5387
95-th Percentile 265181.202
Maximum 1.4933e+06
Range 1.9136e+06
IQR 90359.5387

Descriptive Statistics

Mean 58088.8112
Standard Deviation 105965.3699
Variance 1.1229e+10
Sum 2.2308e+11
Skewness 2.9132
Kurtosis 11.7196
Coefficient of Variation 1.8242
  • AMT_RECIVABLE is not normally distributed (p-value 7.437982151271858e-24)
  • AMT_RECIVABLE has 239817 outliers

AMT_TOTAL_RECEIVABLE

numerical

Approximate Distinct Count 1339008
Approximate Unique (%) 34.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 61444992
Mean 58098.2855
Minimum -420250.185
Maximum 1.4933e+06
Zeros 2113643
Zeros (%) 55.0%
Negatives 109330
Negatives (%) 2.8%
  • AMT_TOTAL_RECEIVABLE is skewed right (γ1 = 2.9127)

Quantile Statistics

Minimum -420250.185
5-th Percentile 0
Q1 0
Median 0
Q3 90371.0925
95-th Percentile 265218.408
Maximum 1.4933e+06
Range 1.9136e+06
IQR 90371.0925

Descriptive Statistics

Mean 58098.2855
Standard Deviation 105971.8011
Variance 1.123e+10
Sum 2.2312e+11
Skewness 2.9127
Kurtosis 11.7159
Coefficient of Variation 1.824
  • AMT_TOTAL_RECEIVABLE is not normally distributed (p-value 7.443507082394071e-24)
  • AMT_TOTAL_RECEIVABLE has 239738 outliers

CNT_DRAWINGS_ATM_CURRENT

numerical

Approximate Distinct Count 44
Approximate Unique (%) 0.0%
Missing 749816
Missing (%) 19.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 49447936
Mean 0.3094
Minimum 0
Maximum 51
Zeros 2665718
Zeros (%) 69.4%
Negatives 0
Negatives (%) 0.0%
  • CNT_DRAWINGS_ATM_CURRENT is skewed right (γ1 = 6.9067)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 2
Maximum 51
Range 51
IQR 0

Descriptive Statistics

Mean 0.3094
Standard Deviation 1.1004
Variance 1.2109
Sum 956351
Skewness 6.9067
Kurtosis 81.5492
Coefficient of Variation 3.556
  • CNT_DRAWINGS_ATM_CURRENT is not normally distributed (p-value 4.566313398661856e-25)
  • CNT_DRAWINGS_ATM_CURRENT has 424778 outliers

CNT_DRAWINGS_CURRENT

numerical

Approximate Distinct Count 129
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 61444992
Mean 0.7031
Minimum 0
Maximum 165
Zeros 3229952
Zeros (%) 84.1%
Negatives 0
Negatives (%) 0.0%
  • CNT_DRAWINGS_CURRENT is skewed right (γ1 = 10.6353)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 4
Maximum 165
Range 165
IQR 0

Descriptive Statistics

Mean 0.7031
Standard Deviation 3.1903
Variance 10.1783
Sum 2.7003e+06
Skewness 10.6353
Kurtosis 177.9281
Coefficient of Variation 4.5373
  • CNT_DRAWINGS_CURRENT is not normally distributed (p-value 4.411805328309986e-25)
  • CNT_DRAWINGS_CURRENT has 610360 outliers

CNT_DRAWINGS_OTHER_CURRENT

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.0%
Missing 749816
Missing (%) 19.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 49447936
Mean 0.004812
Minimum 0
Maximum 12
Zeros 3077688
Zeros (%) 80.1%
Negatives 0
Negatives (%) 0.0%
  • CNT_DRAWINGS_OTHER_CURRENT is skewed right (γ1 = 26.3238)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 12
Range 12
IQR 0

Descriptive Statistics

Mean 0.004812
Standard Deviation 0.08264
Variance 0.006829
Sum 14873
Skewness 26.3238
Kurtosis 1253.2543
Coefficient of Variation 17.1717
  • CNT_DRAWINGS_OTHER_CURRENT is not normally distributed (p-value 4.2297032337627e-25)

CNT_DRAWINGS_POS_CURRENT

numerical

Approximate Distinct Count 133
Approximate Unique (%) 0.0%
Missing 749816
Missing (%) 19.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 49447936
Mean 0.5595
Minimum 0
Maximum 165
Zeros 2825594
Zeros (%) 73.6%
Negatives 0
Negatives (%) 0.0%
  • CNT_DRAWINGS_POS_CURRENT is skewed right (γ1 = 11.3526)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 3
Maximum 165
Range 165
IQR 0

Descriptive Statistics

Mean 0.5595
Standard Deviation 3.2406
Variance 10.5018
Sum 1.7291e+06
Skewness 11.3526
Kurtosis 192.5507
Coefficient of Variation 5.7923
  • CNT_DRAWINGS_POS_CURRENT is not normally distributed (p-value 4.309426629502191e-25)
  • CNT_DRAWINGS_POS_CURRENT has 264902 outliers

CNT_INSTALMENT_MATURE_CUM

numerical

Approximate Distinct Count 121
Approximate Unique (%) 0.0%
Missing 305236
Missing (%) 7.9%
Infinite 0
Infinite (%) 0.0%
Memory Size 56561216
Mean 20.8251
Minimum 0
Maximum 120
Zeros 551467
Zeros (%) 14.4%
Negatives 0
Negatives (%) 0.0%
  • CNT_INSTALMENT_MATURE_CUM is skewed right (γ1 = 1.0756)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 4
Median 15
Q3 33
95-th Percentile 62
Maximum 120
Range 120
IQR 29

Descriptive Statistics

Mean 20.8251
Standard Deviation 20.0515
Variance 402.0624
Sum 7.3618e+07
Skewness 1.0756
Kurtosis 0.6403
Coefficient of Variation 0.9629
  • CNT_INSTALMENT_MATURE_CUM is not normally distributed (p-value 7.915170288874765e-17)
  • CNT_INSTALMENT_MATURE_CUM has 48938 outliers

NAME_CONTRACT_STATUS

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 273052524
  • The largest value (Active) is over 28.69 times larger than the second largest value (Completed)

Length

Mean 6.1017
Standard Deviation 0.5462
Median 6
Minimum 6
Maximum 13

Sample

1st row Active
2nd row Active
3rd row Active
4th row Active
5th row Active

Letter

Count 23431731
Lowercase Letter 19591419
Space Separator 513
Uppercase Letter 3840312
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Active, Completed) take over 50.0%
  • The largest value (active) is over 28.69 times larger than the second largest value (completed)

SK_DPD

numerical

Approximate Distinct Count 917
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 61444992
Mean 9.2837
Minimum 0
Maximum 3260
Zeros 3686957
Zeros (%) 96.0%
Negatives 0
Negatives (%) 0.0%
  • SK_DPD is skewed right (γ1 = 12.947)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 3260
Range 3260
IQR 0

Descriptive Statistics

Mean 9.2837
Standard Deviation 97.5157
Variance 9509.3118
Sum 3.5652e+07
Skewness 12.947
Kurtosis 190.3724
Coefficient of Variation 10.504
  • SK_DPD is not normally distributed (p-value 4.2274816549030155e-25)
  • SK_DPD has 153355 outliers

SK_DPD_DEF

numerical

Approximate Distinct Count 378
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 61444992
Mean 0.3316
Minimum 0
Maximum 3260
Zeros 3750972
Zeros (%) 97.7%
Negatives 0
Negatives (%) 0.0%
  • SK_DPD_DEF is skewed right (γ1 = 89.8304)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 3260
Range 3260
IQR 0

Descriptive Statistics

Mean 0.3316
Standard Deviation 21.4792
Variance 461.3574
Sum 1.2735e+06
Skewness 89.8304
Kurtosis 9007.7273
Coefficient of Variation 64.7702
  • SK_DPD_DEF is not normally distributed (p-value 4.226514402593701e-25)
  • SK_DPD_DEF has 89340 outliers

Interactions

Correlations

Missing Values